SEQUENCE LISTING 



SEQ ID NO: 1 
HDEL (peptide) 

SEQ ID NO: 2 

Pichia pastoris OCH1 gene 

1 AGATCTGCCT GACAGCCTTA AAGAGCCCGC TAAAAGACCC GGAAAACCGA GAGAACTCTG 
61 GATTAGCAGT CTGAAAAAGA ATCTTCACTC TGTCTAGTGG AGCAATTAAT GTCTTAGCGG 
121 CACTTCCTGC TACTCCGCCA GCTACTCCTG AATAGATCAC ATACTGCAAA GACTGCTTGT 
181 CGATGACCTT GGGGTTATTT AGCTTCAAGG GCAATTTTTG GGACATTTTG GACACAGGAG 
241 ACTCAGAAAC AGACACAGAG CGTTCTGAGT CCTGGTGCTC CTGACGTAGG CCTAGAACAG 
301 GAATTATTGG CTTTATTTGT TTGTCCATTT CATAGGCTTG GGGTAATAGA TAGATGACAG 
361 AGAAATAGAG AAGACCTAAT ATTTTTTGTT CATGGCAAAT CGCGGGTTCG CGGTCGGGTC 
421 ACACACGGAG AAGTAATGAG AAGAGCTGGT AATCTGGGGT AAAAGGGTTC AAAAGAAGGT 
481 CGCCTGGTAG GGATGCAATA CAAGGTTGTC TTGGAGTTTA CATTGACCAG ATGATTTGGC 
541 TTTTTCTCTG TTCAATTCAC ATTTTTCAGC GAGAATCGGA TTGACGGAGA AATGGCGGGG 
601 TGTGGGGTGG ATAGATGGCA GAAATGCTCG CAATCACCGC GAAAGAAAGA CTTTATGGAA 
661 TAGAACTACT GGGTGGTGTA AGGATTACAT AGCTAGTCCA ATGGAGTCCG TTGGAAAGGT 
721 AAGAAGAAGC TAAAACCGGC TAAGTAACTA GGGAAGAATG ATCAGACTTT GATTTGATGA 
781 GGTCTGAAAA TACTCTGCTG CTTTTTCAGT TGCTTTTTCC CTGCAACCTA TCATTTTCCT 
841 TTTCATAAGC CTGCCTTTTC TGTTTTCACT TATATGAGTT CCGCCGAGAC TTCCCCAAAT 
901 TCTCTCCTGG AACATTCTCT ATCGCTCTCC TTCCAAGTTG CGCCCCCTGG CACTGCCTAG 
961 TAATATTACC ACGCGACTTA TATTCAGTTC CACAATTTCC AGTGTTCGTA GCAAATATCA 
1021 TCAGCCATGG CGAAGGCAGA TGGCAGTTTG CTCTACTATA ATCCTCACAA TCCACCCAGA 
1081 AGGTATTACT TCTACATGGC TATATTCGCC GTTTCTGTCA TTTGCGTTTT GTACGGACCC 
1141 TCACAACAAT TATCATCTCC AAAAATAGAC TATGATCCAT TGACGCTCCG ATCACTTGAT 
1201 TTGAAGACTT TGGAAGCTCC TTCACAGTTG AGTCCAGGCA CCGTAGAAGA TAATCTTCGA 
1261 AGACAATTGG AGTTTCATTT TCCTTACCGC AGTTACGAAC CTTTTCCCCA ACATATTTGG 
1321 CAAACGTGGA AAGTTTCTCC CTCTGATAGT TCCTTTCCGA AAAACTTCAA AGACTTAGGT 
1381 GAAAGTTGGC TGCAAAGGTC CCCAAATTAT GATCATTTTG TGATACCCGA TGATGCAGCA 
1441 TGGGAACTTA TTCACCATGA ATACGAACGT GTACCAGAAG TCTTGGAAGC TTTCCACCTG 
1501 CTACCAGAGC CCATTCTAAA GGCCGATTTT TTCAGGTATT TGATTCTTTT TGCCCGTGGA 
1561 GGACTGTATG CTGACATGGA CACTATGTTA TTAAAACCAA TAGAATCGTG GCTGACTTTC 
1621 AATGAAACTA TTGGTGGAGT AAAAAACAAT GCTGGGTTGG TCATTGGTAT TGAGGCTGAT 
1681 CCTGATAGAC CTGATTGGCA CGACTGGTAT GCTAGAAGGA TACAATTTTG CCAATGGGCA 
1741 ATTCAGTCCA AACGAGGACA CCCAGCACTG CGTGAACTGA TTGTAAGAGT TGTCAGCACG 
1801 ACTTTACGGA AAGAGAAAAG CGGTTACTTG AACATGGTGG AAGGAAAGGA TCGTGGAAGT 
1861 GATGTGATGG ACTGGACGGG TCCAGGAATA TTTACAGACA CTCTATTTGA TTATATGACT 
1921 AATGTCAATA CAACAGGCCA CTCAGGCCAA GGAATTGGAG CTGGCTCAGC GTATTACAAT 
1981 GCCTTATCGT TGGAAGAACG TGATGCCCTC TCTGCCCGCC CGAACGGAGA GATGTTAAAA 
2041 GAGAAAGTCC CAGGTAAATA TGCACAGCAG GTTGTTTTAT GGGAACAATT TACCAACCTG 
2101 CGCTCCCCCA AATTAATCGA CGATATTCTT ATTCTTCCGA TCACCAGCTT CAGTCCAGGG 
2161 ATTGGCCACA GTGGAGCTGG AGATTTGAAC CATCACCTTG CATATATTAG GCATACATTT 
2221 GAAGGAAGTT GGAAGGACTA AAGAAAGCTA GAGTAAAATA GATATAGCGA GATTAGAGAA 
2281 TGAATACCTT CTTCTAAGCG ATCGTCCGTC ATCATAGAAT ATCATGGACT GTATAGTTTT 
2341 TTTTTTGTAC ATATAATGAT TAAACGGTCA TCCAACATCT CGTTGACAGA TCTCTCAGTA 
2401 CGCGAAATCC CTGACTATCA AAGCAAGAAC CGATGAAGAA AAAAACAACA GTAACCCAAA 
2461 CACCACAACA AACACTTTAT CTTCTCCCCC CCAACACCAA TCATCAAAGA GATGTCGGAA 
2521 CACAAACACC AAGAAGCAAA AACTAACCCC ATATAAAAAC ATCCTGGTAG ATAATGCTGG 
2581 TAACCCGCTC TCCTTCCATA TTCTGGGCTA CTTCACGAAG TCTGACCGGT CTCAGTTGAT 
2641 CAACATGATC CTCGAAATGG GTGGCAAGCA TCGTTCCAGA CCTGCCTCCT CTGGTAGATG 
2701 GAGTGTTGTT TTTGACAGGG GATTACAAGT CTATTGATGA AGATACCCTA AAGCAACTGG 
2761 GGGACGTTCC AATATACAGA GACTCCTTCA TCTACCAGTG TTTTGTGCAC AAGACATCTC 
2821 TTCCCATTGA CACTTTCCGA ATTGACAAGA ACGTCGAC 

SEQ ID NO: 3 

Pichia pastoris alpha-1, 6-mannosyl transferase encoded by the OCH1 gene 
(nt 1027-2241) 
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MAKADGSLLYYNPHNPPRRYYFYMAIFAVSVICVLYGPSQQLSSPKIDYDPLTLRSLDLKTLEAPSQLSPG 
TVEDNLRRQLEFHFPYRSYEPFPQHIWQTWKVSPSDSSFPKNFKDLGES 

HHEYERVPEVLEAFHLLPEPILKADFFRYLILFARGGLYADMDTMLLKPIESWLTFNETIGGVKNNAG 
GIEADPDRPDWHDWYARRIQFCQWAIQSKRGHPALRELIWWSTTLRKEKSGYI^^ 
5 GPGIFTDTLFDYMTNVOTTGHSGQGIGAGSAYYNALSLEE 

TNLRSPKLIDDILILPITSFSPGIGHSGAGDLNHHLAYIRHTFEGSWKD 

SEQ ID NO: 4 

5 ' GGAATTCAGCATGGAGTATGGATCATGGAGTCCGTTGGAAAGG 

10 

SEQ ID NO: 5 

5 ' GCCGCTCGAGCTAGCTTTCTTTAGTCC 

SEQ ID NO: 6 
15 Plasmid pGlycoSwitchM8 (2875 bp) 

AGATCTAACATCCATAATCGATCTAAGCTATATTCGCCGTTTCTGTCATTTGCGTTTTGTACGGACCCTCA 
GJUVCAATTATCATCTCC^AAAAT^ 

AGCTCCTTCACAGTTGAGTCCAGGCTVCCGTAGAAGAT^ 

ACCGCAGTTACGAACCTTTTCCCCAACATATTTGGCAAACGTGGAAAGTTTCTCCCTCTGATAGTTCCTTT 

20 CCGAAAAACTTCAAAGACTTAGGTGAAAGTTGGCTGCAAAGGTC^ 

CGATGATGCAGCATGGGAACTTATTCACCATGAATACGAACGTGTACCAGAAGTCTTGGAAGCTTTTGATT 
TTAACGACTTTTAACGACAACTTGAGAAGATCAAAAAACAACTAATTATTCGCGAAACGAGGAATT^ 
GGCCCAGCCGGCCGTCTCGGATCGGTACCTCGAGCCGCGGCGGCCGCC^GCTTTCTAGAGT^C^AAAACTC 
ATCTCAGAAGAGGATCTGAATAGCGCCGTCGACCATCATCATCATCATCATTGAGTTTGTAGCCTTA 

25 TGACTGTTCCTC^GTTCAAGTTGGGCACTTACGAGAAGACCGGTCTTGCTAGATTCTAATCAAGAGG 

CAGAATGCCATTTGCCTGAGAGATGCAGGCTTCATTTTTGATACTTTTTTATTTGTAACCTATATAGTATA 
GGATTTTTTTTGTCATTTTGTTTCTTCTCGTACGAGCTTGCTCCTGATCAGCCTATCTCGCAGCTGATGAA 
TATCTTGTGGTAGGGGTTTGGGAAAATCATTCGAGTTTGATGTTTTTCTTGGTATTTCCCACTCCTCTT^ 
GAGTACAGAAGATTAAGTGAGACCTTCGTTTGTGCGGATCCCCC^ 

30 CTCCTTTTTTACTCTTCCAGATTTTCTC^ 

GCATACTAAATTTTCCCTCTTTCTTCCTCTAGGGTGTCGTTAATTACCCGTACTAAAGGTTTGGAAAAGAA 
AAAAGAGACCGCCTCGTTTCTTTTTCTTCGTCGAAAAAGGC^TAAAAATTTTTATCACGTTTCTTTTTCT 
TGAAATTTTTTTTTTTAGTTTTTTTCTCTTTCAGTGACCTCCATTGATATTTAAGTTAATAAACGGTCTTC 
AATTTCTCAAGTTTCAGTTTCATTTTTCTTGTTCTATTACAACTTTTTTTACTTCTTGTTCATTAGAAAGA 

3 5 AAGCATAGCAATCTAATCTAAGGGGCGGTGTTGACAA 

TACGACAAGGTGAGGAACTAAACCATGGCCAAGTTGACCA^ 

CGCCGGAGCGGTCGAGTTCTGGACCGACCGGCTCGGGTTCTCCCGGGACTTCGTGGAGGACGACTTCGCCG 
GTGTGGTCCGGGACGACGTGACCCTGTTCATC^GCGCGGT^ 

GCCTGGGTGTGGGTGCGCGGCCTGGACGAGCTGTACGCCGAGTGGTCGGAGGTCGTGTCCACGAACTTCCG 
40 GGACGCCTCCGGGCCGGCCATGACCGAGATCGGCGAGCAGCCGTGGGGGCGGGAGTTCGCCCTGCGCGACC 
CGGCCGGCAACTGCGTGCACTTCGTGGCCGAGGAGCAGGACTGACACGTCCGACGGCGGCCCACGGGTCCC 
AGGCCTCGGAGATCCGTCCCCCTTTTCCTTTGTCGATATCATGTAATTAGTTATGTCACGCTTACATTCAC 
GCCCTCCCCCCACATCCGCTCTAACCGAAAAGGAAGGAGTTAGACAACCTGAAGTCTAGGTCCCTATTTAT 

45 CGTGTACGCATGTAACATTATACTGAAAACCT 

GCAAGCTGGAGACCAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAAC 
GCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGC^^ 

CCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCC 
TGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGT 
50 AGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGA 
CCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCA 
CAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCT 
AACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAG 
AGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCA 



TTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAAC 
GAAAACTCACGTTAAGGGATTTTGGTCATGAGATC 

SEQ ID NO: 7 

5 T. reesei alpha- 1 , 2-mannosidase protein sequence 
SEQ ID NO: 8 

The ORF sequence of the MFManHDEL fusion in pGAPZMFManHDEL : 

ATGAGATTTCCTTC^TTTTTACTGCTC 
1 0 TACAACAGAAGATGAAACGGGACAAATTCCGGC^ 

TCGATGTTGCTGTTTTGCCATTTTCCAACAGCACAAATAACGGGTTATTGTTTATAAATACTACTATTGCC 

AGCATTGCTGCTAAAGAAGAAGGGGTATCTCTCGAGAAAAGAGAGGCTGAAGCTGAATTCGCCACAAAACG 

TGGATCTCCCAACCCTACGAGGGCGGCAGCAGTCAAC^ 

ATTTTGCCTTTCCCCATGACGACCTCC^CCCGGT 
1 5 TCGTCGGCAATCGATGGCTTGGACACGGCTATCCT 

GTATGTACCGCAGATCAACTTCACCACGACTC 

TTCGGTACCTCGGTGGCCTGCTTTCTGCCTATGACCTGTTGCGAGGTCCTTTCAGCTCCTTGGCGACAAAC 
CAGACCCTGGTAAACAGCCTTCTGAGGCAGGCTC^ 

TCCCAGCGGTGTCCCGGACCCTACCGTCTTCTTCAACCCTACTGTCCGGAGAAGTGGTGCATCTAGCAACA 
20 ACGTCGCTGAAATTGGAAGCCTGGTGCTCGAGTGGACACGGTTGAGCGACCTGACGGGAAACCCGCAGTAT 
GCCCAGCTTGCGCAGAAGGGCGAGTCGTATCTCCTGAATCCAAAGGGAAGCCCGGAGGCATGGCC 
GATTGGAACGTTTGTCAGCACGAGCT^CGGTACCTTTCAGGATAGCAGCGGCAGCTGGTCCGGCCTCATGG 
AC^GCTTCTACGAGTACCTGATCAAGATGTACCTGTACGACCCGGTTGCGTTTGCACACTACAAGGATCGC 
TGGGTCCTTGGTGCCGACTCGACCATTGGGC^TCTCGGCTCTCACCCGTCGACGCGCAAGGACTTGACCTT 
25 TTTGTCTTCGTACAACGGACAGTCTACGTCGCC^ 

TCATCTTGGGAGGCATTCTCCTGAACGAGCAAAAGTACATTGACTTTGGAATCAAGC 
TTTGGCACGTACACCCAGACGGCTTCTGGAATCGGCCCCGAAGGCTTCGCGTGGGTGGACAGCGTGACGGG 
CGCCGGCGGCTCGCCGCCCTCGTCCCAGTCCGGGTTCTACTCGTCGGCAGGATTCTGGGTGACGGCACCGT 
ATTACATCCTGCGGCCGGAGACGCTGGAGAGCTTGTACTACGCATACCGCGTCACGGGCGACTCCAAGTGG 
30 CAGGACCTGGCGTGGGAAGCGTTGAGTGCCATTGAGGACGCATGCCGCGCCGGCAGCGCGTACTCGTCCAT 
CAACGACGTGACGCAGGCCAACGGCGGGGGTGCCTCTGACGATATGGAGAGCTTCTGGTTTGCCGAGGCGC 
TCAAGTATGCGTACCTGATCTTTGCGGAGGAGTCGGATGTGCA^ 
GTCTTTAACACGGAGGCGCACCCCTTTAGCATCCGTTCATCATCACGACGGGGCG^ 
CGAGTTG TAA 
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SEQ ID NO: 9 

pGlycoSwitch M5 (5485 bp) : 

AGATCTAACATCCATAATCGATCTAAGCTATATTCGCCGTTTCTGTCATTTGCGTTTTGTACGGACCCTCA 
CAACAATTATCATCTCCAAAAATAGACTATGATC 
40 AGCTCCTTCACAGTTGAGTCCAGGCACCGTAGAAGATAATCTTCGAAGACAATTGGAGTTTCAT^ 
ACCGCAGTTACGAACCTTTTCCCCAACATATTTGGCAAACGTGGAAAGTTTCTCCCTC^ 
CCGAAAAA.CTTCAAAGACTTAGGTGAAAGTTGGCTGCAAAGGTCCCCAAATTATGATCATTTTGT^ 
CGATGATGCAGCATGGGAACTTATTC^CCATGAATAC^ 

CTCACCGCAATGCTGTTAAGGTTCGTATGGAGAAACTGGGACTTATTTAATTATTTAGAGATTTTAACTTA 
45 CATTTAGATTCGATAGATCCACAGGACGGGTGTGGTCGCC^ 

GCGAAGCGAGGAGGACTGGGCGGCGGCCAAAGCGGTCGGACAGTGCTCCGAGAACGGGTGCG 

TGCATCAACGCATATAGCGCTAGCAGCACGCCATAGTC^ 

GAAGAGGCCCGGCAGTACCGGCATAACCAAGCCTATGCCTA^ 

CGATGAGCGCATTGTTAGATTTCATACACGGTGCCTGACTGCGTTAGCAATTTAACTGTGATAAACTACCG 
50 CATTAAAGCTGATCTTTTTTGTAGAAATGTCTTGGTGTC 

CTGGCTCCGTTGCAACTCCGAACGACCTGCTGGCAACGTAAAATTCTCCGGGGTAAAACTTAAATGTGGAG 
TAATGGAACCAGAAACGTCTCTTCCCTTCTCTCTCCTTCCACCGCCCGTTACCGTCCCTAGGAAATTTTAC 
TCTGCTGGAGAGCTTCTTCTACGGCCCCCTTGCAGCAATGCTCTTCCCAGCATTACGTTGCGGGTAAAACG 
GAGGTCGTGTACCCGACCTAGCAGCCCAGGGATGGAAAAGTCCCGGCCGTCGCTGGCAATAATAGCGGGCG 
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GACGC^TGTCATGAGATTATTGGAAAC(^CCAGAATCGAATATAAAAGGCGAACACCTTTCCa^TTTTGG 
TTTCTCCTGACCCAAAGACTTTAAATTTAAT^ 

GAAACGATGAGATTTCCTTCAATTTTTACTGCTGTTTTATTCGCAGCATCCTCCGCATTAGCTGCTCCAGT 
CAACACTACAACAGAAGATGAAACGGCACAAATTC^ 
5 GGGATTTCGATGTTGCTGTTTTGCCATTTTCCAACAGCACAAATAACGGGTTATTGTTTATAAATACTACT 
ATTGCCAGCATTGCTGCTAAAGAAGAAGGGGTATCTCTCGAGAAAAGAGAGGCTGAAGCTGAATTCGCCAC 
AAAACGTGGATCTCCCAACCCTACGAGGGCGGCAGCAGTCAAGGCCGCATTCCAGACGTCGTGGAA 
ACCACCATTTTGCCTTTCCCCATGACGACCTC 
TGGGGCTCGTCGGCAATCGATGGCTTGGACACGG 
10 CCTTCAGTATGTACCGCAGATCAACTTC^ 

CCAACATTCGGTACCTCGGTGGCCTGCTTTCTGCCTATGACCTGTTGCGAGGTCCTTTCAGCTCCTTGGCG 
ACAAACCAGACCCTGGTAAACAGCCTTCTGAGGCAGGCT 

CACCACTCCCAGCGGTGTCCCGGACCCTACCGTCTTCTTCAACCCTACTGTCCGGAGAAGTGGTGCATCTA 
GC^CAACGTCGCTGAAATTGGAAGCCTGGTGCTCGAGTGGACACGGTTGAGCGACCTGACGGGAAACCCG 
1 5 CAGTATGCCCAGCTTGCGCAGAAGGGCGAGTCGTATCT^ 
TGGCCTGATTGGAACGTTTGTCAGCACGAGCAACGGTA 

TCATGGAC^GCTTCTACGAGTACCTGATCAAGATGTACCTGTACGACCCGGTTGCGTTTGCACACTAa^G 
GATCGCTGGGTCCTTGGTGCCGACTCGACCATTC^^ 

GACCTTTTTGTCTTCGTACAACGGACAGTCTACGTCGCCAAACTCAGGACATTTGGCCAGT 
20 GCAACTTC^TCTTGGGAGGCATTCTCCTGAAC^ 

TCGTACTTTGGCACGTACACCCAGACGGCTTCTGGAATCGGCCCCGAAGGCTTCGCGTGGGTGGACAGCGT 

GACGGGCGCCGGCGGCTCGCCGCCCTCGTCCCAGTCCGGGTTCTACTCGTCGGCAGGATTCTGGGTGACGG 

CACCGTATTACATCCTGCGGCCGGAGACGCTGGAGAGCTTGTACTACGCATACCGCGTCACGGGCGACTCC 

AAGTGGCAGGACCTGGCGTGK^AAGCGTTGAGTGCCAT^ 
25 GTCCATC^CGACGTGACGCAGGCCAACGGCGGGGGTGCCTCTGACGATATGGAGAGCTTCT 

AGGCGCTCAAGTATGCGTACCTGATCTTTGCGGAGGAGTCGGATGTGCAGGTGCAGGCCACCGGCGGGAAC 

AAATTTGTCTTTAACACGGAGGCGCACCCCTTTAG^^ 

TCACGACGAGTTGTAATCTAGGGCGGCCGCCAGCTTTCTAGAGAACAAAAACTCATCTCAGAAGAGGATCT 
GAATAGCGCCGTCGACCATCATC7VTCATCATCATT 
30 AAGTTGGGCACTTACGAGAAGACCGGTCTTGCTAGATTCTAATCAAGAGGATGTCAGAATGCCATTTGCCT 
GAGAGATGCAGGCTTCATTTTTGATACTTTTTTATTTGTAACCTATATAGTATAGGATTTTTTTTGTCATT 
TTGTTTCTTCTCGTACGAGCTTGCTCCTGATCAGCCTATCTCGCAGCTGATGAATATCTTGTGGTAGGGGT 
TTGGGAAAATCATTCGAGTTTGATGTTTTTCT 

TGAGACCTTCGTTTGTGCGGATCCCCCACACACCATAGCTTCAAAATGTTTCTACTCCTTTTTTACTCTTC 
35 CAGATTTTCTCGGACTCCGCGCATCGCCGTACCACTT 

TCTTTCTTCCTCTAGGGTGTCGTTAATTACCCGTACTAAAGGTTTGGAAAAGAAAAAAGAGACCGCCTCGT 

GTTTTTTTCTCTTTCAGTGACCTCC^ 

TTTCATTTTTCTTGTTCTATTACAACTTTTTTTACTTCTTGTTCATTAGAAAGAAAGCATAGCAATCTAAT 
40 CTAAGGGGCGGTGTTGACAATTAATCATCGGC^ 
CTAAACCATGGCCAAGTTGACCAGTGCTC 

TCTGGACCGACCGGCTCGGGTTCTCCCGGGACTTCGTGGAGGACGACTTCGCCGGTGTGGTCCGGGACGAC 
GTGACCCTGTTCATCAGCGCGGTCCAGGACCAGGTGGTGC^ 

CGGCCTGGACGAGCTGTACGCCGAGTGGTCGGAGGTCGTGTCCACGAACTTCCGGGACGCCTCCGGGCCGG 
45 CCATGACCGAGATCGGCGAGCAGCCGTGGGGGCGGGAGTTCGCCCTGCGCGACCCGGCCGGCAACTGCGTG 
CACTTCGTGGCCGAGGAGCAGGACTGACACGTCCGACGGCGGCCCACGGGTCCCAGGCCTCGGAGATCCGT 
CCCCCTTTTCCTTTGTCGATATCATGTAATTAGTTATGTCACGCTTACATTCACGCCCTCCCCCCACATCC 
GCTCTAACCGAAAAGGAAGGAGTTAGACAACCTGAAGTCTAGGTCCCTATTTATTTTTTTATAGTT^ 
AGTATTAAGAACGTTATTTATATTTCAAATTTTTCTTTTTTTTCTGTACAGACGCGTGTACGCATGTAACZA 
5 0 TTATACTGAAAACCTTGCTTGAGAAGGTTTTGGGACGCTCG 

ATGTGAGC^AAAGGCC^GCAAAAGGCCTIGGAACCGTAAAAAGGCCGCGTTGCTG^ 
CCGCCCCCCTGACGAGC^TCACAAAAATCGACGCTC^ 

GATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATAC 
CTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGTAGGTATCTCAGTTCGGT 
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GTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCG 
GTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGC 

ATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAG 
AAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGAT 
5 CCGGCAAACAAACGACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAG^ 

GGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTC^CGTTAA^ 
GATTTTGGTCATGAGATC 

SEQ ID NO: 10 
10 S. cerevisiae Kre2 protein 

MALFLSKRLLRFTVI AGAVI VLLLTLNSNSRTQQYI PS S IAAFDFTSGS I S PEQQVI S EENDAKKLEQSAL 

NSEASEDSEAMDEESKALKAAAEKADAPIDTKTTMDYITPSFANKAGKPKACYV^ 

VENKINKKFPYPWVFLNDEPFTEEFKEAVTKAVSSEVKFGILPKEHW 

GGSESYRHMCRYQSGFFWRHELLEEYDWYWRVEPDIKLYCDINYDVFKWM^ 
15 TLWQTSMDFIKKNPEYLDENNLMSFLSNDNGKTYNLCHFWSNFEIANLNLWR 

ERWGDAPVHSIAAALFLPKDKIHYFSDIGYHHPPYDNCPLDKEVYNSNNCECDQGNDFTFQGYSCGKEYYD 

AQGLVKPKNWKKFRE 

SEQ ID NO: 11 

20 S. cerevisiae Kre2 Golgi localization signal (first 100 amino acids) 

MALFLSKRLLRFTVI AGAVI VLLLTLNSNSRTQQYI PS S I SAAFDFTSGS I SPEQQVI SEENDAKKLEQS A 
LNSEASEDSEAMDEESKALKAAAEKADAP 

SEQ ID NO: 12 
25 Human GnTI cDNA 

ATGCTGAAGAAGCAGTCTGCAGGGCTTGTGCTGTGGGGCGCTATCCTCTTTGTGGCCTGGAATGCCCTGCT 

GCTCCTCTTCTTCTGGACGCGCCCAGCACCTGGCAGGCCACCCTCAGTCAGCGCTCTCGATGGCGA 

CCAGCCTCACCCGGGAAGTGATTCGCCTGGCCCAAGAC^ 

CTGCAGCAGATCGGGGATGCCCTGTCGAGCCAGCGGGGGAGGGTGCCCACCGCGGCCCCTCCCGCCCAGCC 

30 GCGTGTGCCTGTGACCCCCGCGCCGGCGGTGATTCCCATCCTGGTCATCGCCTGTGACCGCAGCACTGTTC 
GGCGCTGCCTGGACAAGCTGCTGCATTATCGGCCCTCGGCTGAGCTCTTCCCC^TCATCGTTAGCCAGGA 
TGCGGGCACGAGGAGACGGCCCAGGCCATCGCCTCCTACGGCAGCGCGGTCACGCACATCCGGCAGCCCGA 
CCTGAGCAGCATTGCGGTGCCGCCGGACCACCGCAAGTTCCAGGGCTACTACAAGAT 
GCTGGGCGCTGGGCCAGGTCTTCCGGCAGTTTCGCTTCCCCGCGGCCGTGGTGGTGGAGGATGACCTGGAG 

35 GTGGCCCCGGACTTCTTCGAGTACTTTCGGGCCACCTATCCGCTGCTGAAGGCCGACCCCTCCCTGTGGTG 
CGTCTCGGCCTGGAATGAC7VACGGCAAGGAGCAGATGGTGGACGCCAGCAGGCCTGAGCT 
CCGACTTTTTCCCTGGCCTGGGCTGGCTGCTGTTGGCCGAGCTCTGGGCTGAGCTGGAGCCO^GTGGCCA 
AAGGCCTTCTGGGACGACTGGATGCGGCGGCCGGAGCAGCGGCAGGGGCGGGCCTGCATACGCCCTGAGAT 
CTCAAGAACGATGACCTTTGGCCGCAAGGGTGTGAGCCACGGGC^ 

40 TCAAGCTGAACCAGCAGTTTGTGCACTTCACCCAGCTGGACCTGTCTTACCTGCA 

CGAGATTTCCTCGCCCGCGTCTACGGTGCTCCCCAGCTGCAGGTGGAGAAAGTGAGGACCAATGACCGGAA 
GGAGCTGGGGGAGGTGCGGGTGCAGTATACGGGCAGGGACAGCTTCAAGGCTTTCGCCAA 
TCATGGATGACCTTAAGTCGGGGGTTCCGAGAGCTGGCTACCGGGGTATTGTCACCTTCCAGTTCCGGGGC 
CGCCGTGTCCACCTGGCGCCCCCACCGACGTGGGAGGGCTATGATCCTAGCTGGAATATGCTGAAGAAG^ 

45 GTCTGCAGGGCTTGTGCTGTGGGGCGCTATCCTCTTTGTGGCCTGGAATGCCCTGCTGCTCCTCTTCTTCT 
GGACGCGCCCAGCACCTGGCAGGCC^CCCTCAGTCAGCG 

GAAGTGATTCGCCTGGCCCAAGACGCCGAGGTGGAGCTGGAGCGGCAGCGTGGGCTGCTGCAGCAGATCGG 
GGATGCCCTGTCGAGCCAGCGGGGGAGGGTGCCCACCGCGGCCCCTCCCGCCCAGCCGCGTGTGCCTGTGA 
CCCCCGCGCCGGCGGTGATTCCCATCCTGGTCATCGCCTGTGACCGCAGCACTGTTCGGCGCTGCCTGGAC 
50 AAGCTGCTG(^TTATCGGCCCTCGGCTGAGCTCTTCCCCATCATCGTTAGCCAGGACTGCGGGCACGAG^ 
GACGGCCCAGGCCATCGCCTCCTACGGCAGCGCGGTCACGC^ 
CGGTGCCGCCGGACCACCGCAAGTTCCAGGGCTAC 

CAGGTCTTCCGGCAGTTTCGCTTCCCCGCGGCCGTGGTGGTGGAGGATGACCTGGAGGTGGCCCCGGACTT 
CTTCGAGTACTTTCGGGCCACCTATCCGCTGCTGAAGGCCGACCCCTCCCTGTGGTGCGTCTCGGCCTGGA 



ATGACAACGGC^GGAGCAGATGGTGGACGCC^GCAGGCCTGAGCTGCTCTACCGaiCCGACTTTTTCCCT 
GGCCTGGGCTGGCTGCTGTTGGCCGAGCTCTGGGCTGAGCTGGAGCCCAAGTGGCCAAAGGCCTTCTGGGA 
CGACTGGATGCGGCGGCCGGAGCAGCGGCAGGGGCGGGCCTGCATACGCCCTGAGATCTCAAGAACGATGA 
CCTTTGGCCGCAAGGGTGTGAGCCACGGGCAGTTC^^ 
5 C^GTTTGTGCACTTCACCCAGCTGGACCTGTCTTACCTGCAGCGGGAGGCCTATGACCGAGATTTCCTCGC 
CCGCGTCTACGGTGCTCCCC^GCTGC^GGTGGAGAAAGTGAGGACC^ATGACCGGAAGGAGCTGGGGGAGG 
TGCGGGTGCAGTATACGGGCAGGGACAGCTTCAAGGCTT^ 

AAGTCGGGGGTTCCGAGAGCTGGCTACCGGGGTATTGTCACCTTCCAGTTCCGGGGCCGCCGTGTCCACCT 
GGCGCCCCCACCGACGTGGGAGGGCTATGATCCTAGCTGGAAT 

10 

SEQ ID NO: 13 
Human GnTI protein 

1 MLKKQSAGLV LWGAILFVAW NALLLLFFWT RPAPGRPPSV SALDGDPASL TREVIRLAQD 
61 AEVELERQRG LLQQIGDALS SQRGRVPTAA PPAQPRVPVT PAPAVIPILV IACDRSTVRR 
15 121 CLDKLLHYRP SAELFPIIVS QDCGHEETAQ AIASYGSAVT HIRQPDLSSI AVPPDHRKFQ 

181 GYYKIARHYR WALGQVFRQF RFPAAVWED DLEVAPDFFE YFRATYPLLK ADPSLWCVSA 
241 WNDNGKEQMV DASRPELLYR TDFFPGLGWL LLAELWAELE PKWPKAFWDD WMRRPEQRQG 
301 RACIRPEISR TMTFGRKGVS HGQFFDQHLK FIKLNQQFVH FTQLDLSYLQ REAYDRDFLA 
361 RVYGAPQLQV EKVRTNDRKE LGEVRVQYTG RDSFKAFAKA LGVMDDLKSG VPRAGYRGIV 
20 421 TFQFRGRRVH LAPPPTWEGY DPSWN 



SEQ ID NO: 14 
pPIC6AKrecoGnTI 

25 

GAAATTTTTTTTTTTAGTTTTTTTCTCTTTCAGTGACCTCCATTGATATTTAAGTTAATAAACGGTCTTCA 
ATTTCTCAAGTTTCAGTTTCATTTTTCTTGTTCTATTACAACTTTTTTTACTTCTTGTTCATTAGAAAGAA 
AGCATAGCTUVTCTAATCTAAGGGGCGGTGTTGACAAT 
ACGACAAGGTGAGGAACTAAACCATGGCCAAGCCTTTGTCT 

30 ACGGCTACAATCAACAGCATCCCGATCTCTGAAGACT 

CCGCATCTTCACTGGTGTCAATGTATATCATTTTACTGGGGGACCTTGTGCAGAACTCGTGGTGCTGGG^ 
CTGCTGCTGCTGCGGCAGCTGGCAACCTGACTTGTATCGTCGCGATCGGAAATGAGAACAGGGGCATCTTG 
AGCCCCTGCGGACGGTGCCGAC7VGGTGCTTCTCGATCTGCATCCTGGGATCAAAGCCATAGTGAAGGAC^^ 
TGATGGACAGCCGACGGCAGTTGGGATTCGTGAATTGCTC 

35 TTCGTGGCCGAGGAGCAGGACTGACACGTCCGACGGCGGCCCACGGGTCC(^GGCCTCGGAGATCCGTCC^ 
CCTTTTCCTTTGTCGATATCATGTAATTAGTTATGTCACGCTTACATTCACGCCCTCCCCCCACATCCGCT 
CTAACCGAAAAGGAAGGAGTTAGACAACCTGAAGTCTAGGTCCCTATTTATTTTTTTATAGTTATGTTAGT 

TACTGAAAACCTTGCTTGAGAAGGTTTTGGGACGCTCGAAGGCT^ 
40 TGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCG^ 
CCCCCCTGACGAGCATGACAAAAATCGACGCT 

ACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTG 
TCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGTAGGTATCTCAGTTCGGTGTA 
GGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTA 
45 ACTATCGTCTTGAGTCCAACCCGGTAAGACACGACT^ 

AGGAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAG 
GACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCG 
GCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCT^^ 

TCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTC^CGTTAAGGGAT 
50 TTTGGTCATGAGATGAGATCTAAGATC 

ACAGGTCGATTCTCACACATAAGTGCCAi^ 

GCAGGACCTCCACTCCTCTTCTCCTCAAC^CCCACT^ 

GATTGGAGCTCGCTCATTCCAATTCCTTCT^ 

TGGCCCCCCTGGCGAGGTTCATGTTTGTTTA 
55 CTCCAGATGAGGGCTTTCTGAGTGTGGGGTCAAATAGTTT^^ 



TTAAACGCTGTCTTGGAACCTAATATGACAAAAGCGTGATCTCATCCAAGATGAACTAAGTTTGGTTCGTT 
GAAATGCTAACGGCCAGTTGGTCAAAAAGAAACTTCCAAAA 

GATTGACGAATGCTCAAAAATAATCTCATTAATGCTTAGCGCAGTCTCTCTATCGCTTCTGAACCCCGGTG 
CACCTGTGCCGAAACGCAAATGGGGAAA(^CCCGCT^ 
5 CTTCCAAGATTCTGGTGGGAATACTGCTGATAGCCT 

TACTTGACAGCAATATATAAACAGAAGGAAGCTGCCCTGTCTTAAACCTTTTTTTTTATCATCATTATTAG 
CTTACTTTCATAATTGCGACTGGTTCCAATTGACAAGCTTTTGATTTTAACGACTTTTAACGACAACTTGA 
GAAGATCAAAAAACAACTAATTATTCGAAACGAGGAATTATC 

ATGGCCCTCTTTCTCAGTAAGAGACTGTTGAGATTTACCGTCATTGCAGGTGCGGTTATTGTTCTCCTCCT 
10 AACATTGAATTCCAACAGTAGAACTC^GCAATATAT^ 
CAGGATCTATATCCCCTGAACAACAAGTCATCTC^ 

CTGAATTCAGAGGCAAGCGAAGACTCCGAAGCCATGGATGAAGAATCCAAGGCTCTGAAAGCTGCCGCTGA 
AAAGGCAGATGCCCCGCCGGCGGTGATTCCCATCCTGGTCATCGCCTGTGACCGCAGCACTGTTCGGCGCT 
GCCTGGACAAGCTGCTGCATTATCGGCCCTCGGCT 
15 CACGAGGAGACGGCCCAGGCCATCGCCTCCTACGGCAGC^ 
CAGC^TTGCGGTGCCGCCGGACCACCGCAAGTTC 

CGCTGGGCCAGGTCTTCCGGCAGTTTCGCTTCCCCGCGGCCGTGGTGGTGGAGGATGACCTGGAGGTGGCC 
CCGGACTTCTTCGAGTACTTTCGGGCCACCTATCCGCTGCTGAAGGCCGACCCCTCCCTGTGGTGCGTCTC 
GGCCTGGAATGACAACGGCAAGGAGCAGATGGTGGACGCCAGCAGGCCTGAGCTGCT 
20 TTTTCCCTGGCCTGGGCTGGCTGCTGTTGGC^ 

TTCTGGGACGACTGGATGCGGCGGCCGGAGCAGCGGCAGGGGCGGGCCTGC^TACGCCCTGAGATCTCAAG 

AACGATGACCTTTGGCCGCAAGGGTGTGAGCCAC 

TGAACC^GCAGTTTGTGCACTTCACCCAGCT 

TTCCTCGCCCGCGTCTACGGTGCTCCCCAGCTGCAGGTGGAGAAAGTGAGGACCAATGACCGGAAGGAGCT 
25 GGGGGAGGTGCGGGTG(^GTATACGGGCAGGGACAGCTTCAAGGCTTTCGCCAAGGCTCT 

ATGACCTTAAGTCGGGGGTTCCGAGAGCTGGCTACCGGGGTATTGTCACCTTCCAGTTCCGGGGCCGCCGT 
GTCCACCTGGCGCCCCCACCGACGTGGGAGGGCTATGATCCTAGCTGGAATTAGCACCTGTCGACTGGAGA 
CCTGCAGGCATGCAAGCTTCGACCATCATCATCATCATCATTGAGTTTGTAGCCT 
TCAGTTCAAGTTGGGGACTTACGAGAAGACCGGTCTTC 
30 TTTGCCTGAGAGATGCAGGCTTCATTTTTGATACTTTTTTATTTGTAACCTATATAGTATAGGATTTTTTT 
TGTCATTTTGTTTCTTCTCGTACGAGCTTGCTCCTGATCAGCCTATCTCGCAGCTGATGAATATCTTGTGG 
TAGGGGTTTGGGAAAATCATTCGAGTTTGATGTTTTTCTTGGTATTTCCCACTCCTCTTCAGAGTACAGAA 
GATTAAGTGAGACCTTCGTTTGTGCGGATCCCCCACACACCATAGCTTCAAAATGTTTCTACTCCTTTTTT 
ACTCTTCCAGATTTTCTCGGACTCCGCGCATCGCC^ 
3 5 TTTTCCCTCTTTCTTCCTCTAGGGTGTCGTTAATTACCCGTACTAAAGGTTTGGAAAAGAAAAAAGAGACC 
GCCTCGTTTCTTTTTCTTCGTCGAAAAAGGCAATAAAAATTTTTATCACGTTTCTTTTTCTT 

SEQ ID NO: 15 

5 ' TTCGAAGCTTCGCTAGCTCGGTGTCCCGATGTC 

40 

SEQ ID NO: 16 

5 ' GAATTCGAAGGGAAGATGAGGCTTCGGGAGCC 

SEQ ID NO: 17 
45 5 ' CGTTCGCGACCGGAGGGGCCCGGCCGCC 

SEQ ID NO: 18 

5 ' TCGATATCAAGCTTAGCTCGGTGTCCCGATGTC 

50 SEQ ID NO: 19 

5 ' GAATTCGAACTTAAGATGGCCCTCTTTCTCAGTAAG 

SEQ ID NO: 20 

41 



Human GalTl cDNA: 

ATGAGGCTTCGGGAGCCGCTCCTGAGCGGCGCCGCGATGCCAGGCGCGTCCCTACAGCGGGCCTGCCGCCT 
GCTCGTGGCCGTCTGCGTCTGGCACCTTGGCGTCACCCTCGTTTACTACCTGGCTGGCCGCGACCTGAGCC 
GCCTGCCCCAACTGGTCGGAGTCTCCACACCGCTC 
5 TCCTCCGGGGAGCTCCGGACCGGAGGGGCCCGGCCGCCGCCTCCTCTAGGCGCCTCCTCCCAGCCGCGCCC 
GGGTGGCGACTCCAGCCCAGTCGTGGATTCTGGCCCTGGCCCCGCTAGCAACTTGACCTCGGTCCCAGTGC 
CCCACACCACCGCACTGTCGCTGCCCGCCTGCCCTGAGGAGTCCCCGCTGCTTGTGGGCCCCATGCTGATT 
GAGTTTAACATGCCTGTGGACCTGGAGCTCGTGGCA 
TGCCCCCAGGGACTGCGTCTCTCCTCACAAGGTGGCC^^ 
10 TG^GTACTGGCTATATTATTTGCACCCAGTCCT^ 

AAC CAGGCGGGAGACACTATATTCAATCGTGCTAAGCT C CTCAATGTTGGCTTT CAAGAAGC CTTGAAGGA 
CTATGACTACACCTGCTTTGTGTTTAGTGACGTGGACCTCATTCCAATGAATGACCATAATGCGTACAGGT 
GTTTTTCACAGCCACGGCACATTTCCGTTGCAATGGATAAGTTTGGATTCAGCCT 

TTTGGAGGTGTCTCTGCTCTAAGTAAA(^(^GTTTCTAACC^TCAATGGATTTCCTAATAATTATTGG^ 
1 5 CTGGGGAGGAGAAGATGATGACATTTTTAACAGATTAGTTTTTAGAGGCATGTCTATATCT^ 
CTGTGGTCGGGAGGTGTCGCATGATCCGCC^CTCAAGAGACAAAAAAAATGAACCCAATCCT 
GACCGAATTGCACACACAAAGGAGACAATGCTCTCTGATGGTTTGAACTCACT 
TGTACAGAGATACCCATTGTATACCCAAATCACAGTGGAC^TCGGGACACCGAGC 

20 SEQ ID NO: 21 

Human GalTl protein: 

MRLREPLLSGAAMPGASLQRACRLLVAVCVWHLGVTLVYYLAGRDLSRLPQLVGVS 

SSGELRTGGARPPPPLGASSQPRPGGDSSPWDSGPGPASNLTSVPVPHTTALSLPACPEESPLLVGPMLI 
E FNMPVDLELVAKQNPNVKMGGR YAPRDCVS PHKVAI IIP FRNRQEHLKYWL YYLHPVLQRQQLD YGI YVI 
25 NQAGDTIFNRAKLl^GFQEALKDYDYTCFVFSDVDLIPMNDHN 

FGGVS AL SKQQFLT INGFPNNYWGWGGEDDDI FNRLVFRGMS I S RPNAWGRCRMI RHSRDKKNEPNPQRF 
DRIAHTKETMLSDGLNSLTYQVLDVQRYPLYTQITVDIGTPS 



30 SEQ ID NO: 22 

pBlKanMX4KrehGalT : 

CTAGTGCACAAACGAACGTCTCACTTAATCTTCTC 

TCAAACTCGAATGATTTTCCCAAACCCCTACCACAAGATATTCATCAGCTGCGAGATAG^ 
CAAGCTCGTACGAGAAGAAACAAAATGAC2UULAAAAATC 
3 5 CAAAAATGAAGCCTGCATCTCTCAGGCAAATG^ 

GTCTTCTCGTAAGTGCCCAACTTGAACTGAGGAACAGTCATGTCTAAGGCGAATTATCAAGCTTAGCTCGG 
TGTCCCGATGTCCACTGTGATTTGGGTATACAATG 

AGTTCAAACCATCAGAGAGCATTGTCTCCTTTGTGTGTGCAATTCGGTCAAACCTCTGAGGATTGGGTT^ 
TTTTTTTTGTCTCTTGAGTGGCGGATCATGCGACACCTCCCGACCACAGCATTTGGGCGAGATATAG^ 
40 GCCTCTAAAAACTAATCTGTTAAAAATGTCATCATCTTCTCCTCCCCAGCCCCAATAATTATT^ 
CATTGATGGTTAGAAACTGTTGTTTACTTAGAGCAGAGACACCTCCAAAATACTGAACATAAG^ 
AATCCAAACTTATCCATTGCAACGGAAATGTGCCGTGG 
CATTGGAATGAGGTCC7VCGTCACTAAACACAAAGCAGGTGTAGTCATA 

CAACATTGAGGAGCTTAGCACGATTGAATATAGTGTCTCCCGCCTGGTTGATAACATAGATGCCATAGTCC 
45 AGCTGCTGGCGCTGCAGGACTGGGTGCAAATAATATAGCCAGTACTTGAGGTGCTCCTGCCGGTTGCGGAA 

TGGAATGATGATGGCCACCTTGTGAGGAGAGACGCAGTCCCTGGGGGCATAGCGGCCGCCCATCTT 

TTGGGTTCTGCTTTGCCACGAGCTCCAGGT^ 

AGCGGGGACTCCTGAGGGCAGGCGGGCAGCGACAGT^ 

GCTAGCGGGGCCAGGGCCAGAATCGACGACTGGGCT^ 
50 CTAGAGGAGGCGGCGGCCGGGCCCCTCCGGTCGCCGG 

GCCTTGGATTCTTC^TCCATGGCTTCGGAG 

TTTAGCATCATTTTCCTC^GAGATGACTTGOT^ 

CGGAGATGGAACTCGGAATATATTGCTGAGTTCTACTGTTGGAATTCAATGTTAGGAGGAGAACAATAACC 
GCACCTGCAATGACGGTAAATCTCAACAGTCTCTTACTGAGAAAGAGGGCCATCTTAAGTTCGAATAATTA 



GTTGTTTTTTGATCTTCTC^GTTGTCGTTAAAAGTCGTTAAAATCAAAAGCTTGTCAATTGGAACCAGTC 
GCAATTATGAAAGTAAGCTAATAATGATGATAAAAAAAAAGGTTTAAGACAGGGCAGCTTCCTTCTGTTTA 
TATATTGCTGTCAAGTAGGGGTTAGAACAGTTAAATTTT 
GACCAGAATCTTGGAAGCATACAATGTGGAGACAATGC^ 
5 GCGTTTCGGCACAGGTGC^CCGGGGTTCAGAAGCGATAGAGAGACTGCGCTAAG(^TTAATGAGATTATT^ 
TTGAGCATTCGTCAATCAATACC^AACAAGACAAACGGTATGCCGACTTTTGGAA^^ 
CTGGCCGTTAGCATTTCAACGAACCAAACTTAGTTCATCTTGGATGAGATCACGCTTTTGTCAT^ 
TCCAAGACAGCGTTTAAACTGTCAGTTTTGGGCCATTTGGGGAAC^TGAAACTATTTGACCC 
AAAGCCCTCATCTGGAGTGATGTTCGGGTGTAATGCGGAGCTTGTTGC^TTCGGAAATAAACAAACATG^ 
1 0 CCTCGCCAGGGGGGCCAGGATAGACAGGCTAATAAAGTCATGGTGTTAGTAGCCTAATAGAAGGAATTGGA 
ATGAGCGAGCTCCAATCAAGCCCAATAACTGGGCTGGTTTT^ 

AGGAGTGGAGGTCCTGCGTTTGCAACGGTCTGCTGCTAGTGTATCCCCTCCTGTTGCGTTTGGCACTTATG 
TGTGAGAATGGACCTGTGGATGTCGGATGGCAAAAAGGTTTCATTCAACCTTTCGTCTTTGGATGTTAGCT 
AGCCGGCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTCTTCCGCTTC 
15 CTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAA 
TACGGTTATCCAGAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGG 

AACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCC^TAGGCTCCGCCCCCCTGACGAGCATC^CAAAAATCG 

ACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACC^GGCGTTTCCCCCTGGAAGCTC 

TCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTG 

20 GCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGT 
GCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAA 
GACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCT 
ACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCT 
GAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTG 

25 GTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCT^^ 

ACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGAT 
CTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGT 
CTGACAGTTACCAATGCTTAATCAGTGAGGCACCTAT^ 

GCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGAT 

30 ACCGCGAGACCCACGCTCACCGGCTCC^GATTTAT 

GAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGT 
TCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGTGGTGTCACGCTCGTCGTTTGG 
TATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCAT 
CGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATG 

35 GCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAAC 
CAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGT^ 

CGCCAC^TAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCA^ 
TTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTG<^CCCAACTGATCTTCAGCATCTTTTACTT^ 
CACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAA^TGCCGCAAA 
40 AATGTTGAATACTCATACTCTTCCTTTTTCAATGCTTCTAGTGGTAGGAATTAATTCTGTACCGGTTTACA 
GAAGGACGACTCTTGATGCGCCAACCACAGTGACAAT^ 

GATGCTGCTGGAGACGATTCAAAGTTTAGTTTAGAAAGGTCCTCCATTTATGCTGATAGAATACTAGATAC 
CCGTGAACTTTGTCTCAGGAGATCCGCATCAGACG 

CCCCTAGGCAGTTGGTGAGCTTACATGAGAAGTCTAAACTATCTTGGACCCGCTGGTTTTATAAAGGGTTC 
45 GTTAGGAATGCGTTAACTACCATTCCAGC^UVCATCCGTGGGGCTTCTGGTGTTTGAAATACTGCGTCAAAA 
ATTGAGCGATGAAATTGAAGATCGATTCAGTTGAATCGCCCGAAACAATTGATCCCCTGTACATACTTGTA 
ATTTACCTC7VGAATGGGTTAATTAAGGCGCGCCAGATCTGTTTAGCTTGCCTCGTCCCCGCCGGGTCACCC 
GGCCAGCGACATGGAGGCCCAGAATACCCTCCTTG^ 

ACTGTCGCCCGTAC^TTTAGCCCATACATCCCCATGTATAATCATTTGCATCCATACATOT 
50 ACGGCGCGAAGCAAAAATTACGGCTCCTCGCTGCAGAC 

GTTGAATTGTCCCCACGCCGCGCCCCTGTAGAGAAATATAAAAGGTTAGGATTTGCCACTGAGGTTCTTCT 
TT CATATACTTC CTTTTAAAATCTTGCTAGGATACAGTTCTCACATCACAT C CGAACATAAACAAC CATGG 
GTAAGGAAAAGACTCACGTTTCGAGGCCGCGATTAAATTCCAACATGGATGCTGATTTATATGGGTATAAA 
TGGGCTCGCGATAATGTCGGGCAATCAGGTGCGACAATCTATCGATTGTATGGGAAGCCCGA 



GTTGTTTCTGAAACATGGCAAAGGTAGCGTTGCCAATG^ 

TGACGGAATTTATGCCTCTTCCGACCATCAAGCT^^ 

ACTGCGATCCCCGGCAAAACAGCATTCCAGGTATT^ 

TGCGCTGGCAGTGTTCCTGCGCCGGTTGCATTCGATTCCTGTTTGTAATTGTCCTTTTAACAGCGATCGCG 

TATTTCGTCTCGCTCAGGCGCAATCACGAATGAATAACGGTTTGGTTGATGCGAGTGATTTTGATGACGAG 

CGTAATGGCTGGCCTGTTGAACAAGTCTGGAAAGAAATGCATAAGCTTTTGCCATTCTCACCGGATTCAG 

CGTCACTCATGGTGATTTCTCACTTGATAACCTTATTTTTGACGAGGGGAAATTAATAGGTTGTATTGATG 

TTGGACGAGTCGGAATCGGAGACCGATACCAGGATCTTGCCATCCTATGGAACTGCCTCGGTGAGTTTTCT 

CCTTCATTACAGAAACGGCTTTTTCAAAAATATGG 

TTTGATGCTCGATGAGTTTTTCTAATC^GTACTGAC^TAAAAAGATTCTTGTTTTC^GAACTTGTC^TT 
TGTATAGTTTTTTTATATTGTAGTTGTTCTATTTTAATCAAATGTTAGCGTGATTTATATTTTTTTTCGCC 
TCGAGATCATCTGCCCAGATGCGAAGTTAAGTGC^ 

CTGGTCGCTATACTGCTGTCGATTCGATACTAACGCCGCCATCCAGTGTCGAAAACGAGCTCGAATTCATC 
GATGATATCAGATCCA 



44 



